On The Capacity Of Time- Varying Channels 

With Periodic Feedback 

Mehdi Ansari Sadrabadi, Mohammad Ali Maddah-Ali and Amir K. Khandani 

Coding & Signal Transmission Laboratory (www.cst.uwaterloo.ca) 
Dept. of Elec. and Comp. Eng., University of Waterloo 
Waterloo, ON, Canada, N2L 3G1 
Tel: 519-884-8552, Fax: 519-888-4338 
e-mail: {mehdi, mohammad, khandani} @cst.uwaterloo.ca 

S" 
o , 

Abstract 

(N 

5^ ■ The capacity of time-varying channels with periodic feedback at the transmitter is evaluated. It is assumed that 

the channel state information is perfectly known at the receiver and is fed back to the transmitter at the regular 
time-intervals. The system capacity is investigated in two cases: i) finite state Markov channel, and ii) additive white 
Gaussian noise channel with time-correlated fading. In the first case, it is shown that the capacity is achievable 
by multiplexing multiple codebooks across the channel. In the second case, the channel capacity and the optimal 
adaptive coding is obtained. It is shown that the optimal adaptation can be achieved by a single Gaussian codebook, 
while adaptively allocating the total power based on the side information at the transmitter. 
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Index Terms 

Channel capacity, Gaussian channel, periodic feedback, time-correlated Rayleigh fading. 



> ' I. Introduction 

: 

Communications theory over time- varying channels has been widely studied from different perspectives 
^~ 1 I regarding the availability of the channel state information (CSI) at the transmitter and/or the receiver. 
q ■ Communication with perfect CSI at the transmitter is studied by Shannon in [1], where the capacity is 
C^- ■ expressed as that of an equivalent memoryless channel without side information at either the transmitter 
or the receiver. Communication with perfect CSI at the receiver is investigated, for example, in [2]. 
With the assumption of perfect CSI at both the transmitter and the receiver, the capacity of finite state 
Markov channels (FSMCs) and compound channels is studied in [3] and [4], respectively. In practice, the 
• £h . assumption of perfect CSI is not practical due to estimation inaccuracy, limited feedback channel capacity, 
<S [ or feedback delay. Communication with imperfect side information is well investigated in the literature 
c3 ' [5]-[8]. In [5], the capacity of FSMCs is evaluated based on the assumed statistical relationship of the 
channel state and side information at the transmitter. The channel capacity, when feedback delay is taken 
into account, is studied in [9], [10]. The optimal transmission and feedback strategies with finite feedback 
alphabet cardinality is investigated in [11]. 

In this paper, we consider a point-to-point time- varying channel with perfectly known CSI at the receiver. 
It is assumed that the channel is constant during a channel use and varies from one channel use to the 
next, based on a Markov random process. The CSI is provided at the transmitter through a noiseless 
feedback link at regularly- spaced time intervals. Every T channel use, the CSI of the current channel use 
is fed back to the transmitter. We obtain the channel capacity of the system and show that it is achievable 
by multiplexing T codebooks across the channel. It is worth mentioning that for FSMCs, the results of 
[5] apply directly to compute the channel capacity, if the side information at the transmitter and receiver 
are jointly stationary. However, in our model, the side information at the transmitter is not stationary. 

Adaptive transmission is an efficient technique to increase the spectral efficiency of time-varying wireless 
channel by adaptively modifying the transmission rate, power, etc., according to the state of the channel 
seen by the receiver. Adaptive transmission, which requires accurate channel estimates at the receiver and 
a reliable feedback path between the receiver and transmitter, was first proposed in the late 1960's [12]. 



2 



A variable-rate and variable-power MQAM modulation scheme for high-speed data transmission over 
fading channels is studied in [13] [14], where the transmission rate and power are optimized to maximize 
the spectral efficiency. We utilize the introduced feedback model to obtain the capacity of additive white 
Gaussian noise (AWGN) channel with time-correlated fading. It is shown that the capacity is achievable 
using a single codebook with adaptively allocating power based on the side information at the transmitter. 
Also, the optimum power allocation is derived. 

The rest of the paper is organized as follows. In Section [TH the system model is described and the 
channel capacity is obtained. The capacity of time-correlated fading channel with periodic feedback is 
derived in Section [TTTJ The impact of channel correlation and feedback error on the capacity is evaluated 
in Section [IV] Finally, the paper is concluded in Section [V] 

Throughout this paper, upper case letters represent random variables; lower case letters denote a 
particular value of the random variable; d 1 ^ represents the sequence (a m , ■ • • , a n ) and a* is the complex 
conjugate of a. 



II. Markov Channel With Feedback State 

We consider a channel with discrete input X n 6 X and discrete output Y n E y at time instant n. The 
channel state is characterized as a finite-state first order Markov process: 

Pr(u n K~ fe ) =Pr(u n \u r ^ k ) (1) 

The state process, U n E U, is independent of the channel input and channel output: 

N 

Pr(xf,yf|<) = l[Pr(x n ,y n \u n ). (2) 

n=l 

It is assumed that CSI is perfectly known at the receiver. The CSI is provided at the transmitter through 
a noiseless feedback link periodically at every T symbols, i.e., Ui, Ut+i, C/zr+i, • • • are sent over the 
feedback link and instantly received at the transmitter. Assume that the codeword length, N, is an integer 
factor of T and M = f . Let us define VJ = U T(i . 1)+1 for 1 < i < M and h = [f J + 1. 

Encoding and Decoding: Assume that W E VV is the message to be sent by the transmitter and 
A w = 2 NR is the cardinality of W. A codeword of length N is a sequence of the encoding function ip n 
which maps the set of messages to the channel input alphabets. The input codeword at time n depends 
on the message w and the CSI at the transmitter up to time n, i.e. v™, 

x n = <p n (w,v"). (3) 

The decoding function, (ft, maps a received sequence of iV channel outputs using CSI at the receiver to 
the message set such that the decoded message is w = (j){yf, u^). 

Theorem 1 The capacity of a finite state Markov channel with periodic feedback is given by 

1 T 

r E E Pr ^ ^ x E P Mv)I{X; Y\u, v), (4) 

1 f—* *■ — ' qt(x\v) * — ' 

1=1 V u 

where T is the feedback period, P t (u\v) = Pr u ^ u ._ t+1 (u\v) and q t {x\v) is the random coding probability 
distribution function (PDF) parametrized with subscript t to reflect the dependency on time. 
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A. Achievability 

We state a result on the capacity of FSMCs, which we then apply in the proof. It is shown that the 
capacity of FSMCs with perfectly known CSI, U, at the receiver and side information V at the transmitter 
is [5] 

C = VPr(V)max S2~Pi(u\v)I(X]Y\u,v), (5) 

' J q(x\v) ' ' 

V u 

where U and V are jointly stationary and ergodic with joint PDF Pr({7, V) and V is a deterministic 
function of U. 

We consider the channel as T parallel subchannels where the t th subchannel (1 < t < T) occurs in 
time instances (i— l)T + t, 1 < % < M. Noting that the channel state of the t th subchannel {£/($._ i)T+t}|£i 
and the side information at the transmitter {l^}^! = {f/(j_i)T+i}^£i are jointly stationary and ergodic, 
we define P t (u\v) = Pr Mi | Ul _ t+1 (ii|v) for 1 < t < T. Using ©, the achievable rate of the t th subchannel 
is 

Rt = y\ Pr(«) max V P t (u\v)I(X; Y\u, v). (6) 

— ' qt(x\v) — ' 

V u 

T codebooks are designed corresponding to R t for 1 < t < T and multiplexed across the T subchannels, 
i.e., at time instants (i — 1)T + 1 for 1 < i < M, the channel inputs from the t th codebook are sent over 
the channel. Therefore, the achievable rate is 

1 T 

R= - VVPr(?)) max V P t (u\v)I(X;Y\u,v). (7) 

T ^—f ' qt{x\v) ' 

1 = 1 V u 



B. Converse 

In this part, we prove the converse to the capacity theorem. The proof is motivated by the proof in [5]. 
From the Fano's inequality [15], we have 

H(W\Yf, C/f) < P e log A w + h(P e ) = Ne N , (8) 

where P e = Pr(W ^ W) and e N -> as N -> oo. 

H(W\Y N ,U?) = H(W\U?)-I(W-Y*\U?) 

= NR — I(W; Y^\U±). (9) 

Using © and ©, we can write 

R< -^I(W;Y 1 N \U?) + e N . (10) 
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Then we have, 



I(W;Y?\U^ 

N 



n=l 
N 



= H(Y n \U N , Yr 1 ) - H(Y n \Uf, Yr\ W) 

n=l 
N 

< H(Y n \U n , \f) - H(Y n \U N , Yr\ W) 

n=l 
N 

< H(Y n \U n , Vf ) - H(Y n \U n , X n , Vf ) (11) 

n=l 
iV 

= Y,I(X n ;Y n \U n ,V?), (12) 



n=l 



where (a) follows from the fact that the channel output is independent of the message and past channel 
outputs given the state of the channel and the channel input. On the other hand, for a given n, we have 

I(X n ; Y n \U n , V™) 

=J2 ; Pr^J^^r^PrK-V^Pr^)/^;^!^,^" 1 ,^) 
=Y Pr(u„|^)Pr(^) ^PrK"> fi )/(X n ;r n | Mn ,<- 1 ,^) 



u„,v A v n- 
c 



<\ Pr(w n |^)Pr(u ft ) max I(X n ; Y n \ ), (13) 

q(x n \v A ) 

where (b) follows from the property in (OQ), and (c) results from the concavity of mutual information with 
respect to the input distribution, and q(x n \vn) = 5^« A_1 ^ r ( v i~ l \ v n)^(x n \v^). Replacing n = (n — l)T + t 

r N I j jN\ 



in (1131) and using (fT2l) . we have 



M T 

(n— l)T+t I )Pr(^n) x 

n=l (=1 u fi « (ft _ 1)T+t 

max J(X( ft _i) r+t ; F( fi _i) T+ t|'U( ft _i) T+t , 

?(a!(n-l)T+tl«a) 

(14) 

T 

=MV VP t («|?;)Pr(t)) max J(X;F|m,v), (15) 

T~f ' Qt(x\v) 

i=l 

where (fT5l) follows from the fact that {V*}^ and {f7( i _ 1 ) T , +t are jointly stationary and ergodic and 
the right-hand side of (fl4l) does not depend on n. Using (flOl) and (fT5l) . we have 

1 T 

R<7^^yy^ Pr(v) max V P t (u|v) J(X; 1>, u) + e^. (16) 
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III. Gaussian Channel 

In this section, we consider a point to point transmission over a time-correlated fading channel. It is 
assumed that the channel gain is constant over each channel use (symbol) and varies from symbol to 
symbol, following a first order Markovian random process. The signal at the receiver is 

fn h n x n -\- z n , (17) 

where h n G C is the fading gain and z n is AWGN with zero mean and unit variance. It is assumed that 
the CSI is perfectly known to the receiver. Every T channel use, the instantaneous fading gain is sent 
to the transmitter through a noiseless feedback link, i.e., \hi\, \h T +i\, • • • , |^(m-i)t+i| are fed back and 
instantly received at the transmitter. 

Let us define u n = \h n \ 2 for 1 < n < N, Vi = \h^i) T+ i\ 2 for 1 < i < M and P t {u\v) = Pr Ui \ Ui ^ t+1 (u\v). 
The average input power is subject to the constraint E[|x n | 2 ] < V. In the following, E t [g(U, V)] denotes 
the expectation value over g{u,v) where U and V have joint PDF P t (u,v). 

Theorem 2 The capacity of time-correlated fading channel with periodic feedback is 

1 T 

max-^E t [log(l + l^(VO)], (18) 
PvPt 1 

subject to ^ Ylt=i ^[Pt(V)] — where T is the feedback period. 

First, we recount some results on the capacity of single user channels, which is applied in the proof. 
A general formula for the capacity of single user channels which is not necessarily information stable 
or stationary is obtained in [16]. Consider input X and output Y as sequences of finite-dimensional 
distribution, where Y is induced by X via a channel which is an arbitrary sequence of finite-dimensional 
conditional output distribution from input alphabets to the output alphabets. The general formula for the 
channel capacity is as follows: 

C = sup I(X;Y), (19) 
x 

where I_(X; Y) is defined as the liminf in probability of the normalized information density [16] 

M ^;y 1 ") = llog gg™ . (20) 

Assume that the channel state information, Q, is available at the receiver. Considering Q as an additional 
output, the channel capacity is C = sup x I_(X] Y, Q). If Q is not available at the transmitter and is 
consequently independent of X, then the capacity is [17] 

C = sup/(X;y|Q), (21) 

x 

where I_(X; Y\Q) is the liminf in probability of the normalized conditional information density 

i N (X, , Y x \Q X ) - - log pr{Yfm ■ (22) 
Now, we are ready to prove Theorem 2, where the proof is motivated by the proof in [5]. 
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A. Achievability 

Noting (fTTl) . the processed received signal at time n is 



h* 

Vn ^nT~, f \hfi \%n (23) 



where z' n = j^z n , which has the same distribution as z n . The transmitter sends 



(24) 



over the channel where s n is an i.i.d. Gaussian codebook with zero mean and unit variance, and p n 
IR + — > M + is the power allocation function. Using (1231) and (|24j) . we can write 



Vn = y/q^s n + z' n , (25) 



where q n = p n (vn) \h n \ 2 = p n (vh)u n . Noting (|25l) . we have a channel with input S and output Y and 
channel state Q, which is known at the receiver. Since is independent of S^, we can use (12TI) to 
obtain the achievable rate. 



N & P^lQf) 



AT 

(26) 



ig( log(1+Q „ ) + J|£__ K|2 ) 



where (d) results from the fact that and Z\ are i.i.d. sequences and the last line follows from the 
fact that Y n conditioned on Q n is Gaussian with zero mean and variance 1 + Q n . Note that as N — > oo, 
Ji Yln=i i+q = Jf En=i l^nl 2 = ^- w ^ tn probability one. Therefore, with probability one, we have 

1 N 

= -£> g (i + Q n ) 

n=l 



T M 

MT 



^EX>s(i + Q(*-i)T+^ 



t=l i=l 
T „ M 



= t S Tf E lo S ( X + U^T+tPii-DT+tiVi)) . (27) 
t=i i=i 

Noting that {£7( i _ : n T+t }££ 1 and {V A j}fi 1 are jointly stationary and ergodic for 1 < £ < T, we define 
P t (u, u) to be their joint PDF. We set p(i-x) T +t = Pt for 1 < * < M and 1 < t < T. As M -> oo in (l27l) . 
the sample mean converges in probability to the expectation. Therefore, the achievable rate is 

1 T 

fi=-^E ( [log(l + f/ft(F))]. (28) 
t=i 
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B. Converse 

Using (fTTI) . we have 



N 



I(W; iflC/f) < H(Y n \U n , V?) - H(Y n \U n , X n , Vf ) 



n=l 
N 



<^E[log(l + f/ n E[|X n | 2 |Vf])]. (29) 



n=l 



The above inequality relies on the facts that 

H(Y n \U n ,X n ,V?) = H(Z n ) = \og2ne (30) 

and 

H(Y n \U n , V?) < E[log(27re(l + U n E[\X n \ 2 \V; i ]j)]. (31) 

The upper-bound in (|3T| ) is achieved if X n conditioned on V™ has a Gaussian distribution. We set x n = 
a/ fn{vi)s n where / n : R" — ► R + and is an i.i.d. Gaussian sequence with zero mean and unit variance. 
On the other hand, 

E[log(l + U n f n (V? ))] = E[E[log(l + U n f n (V?))\U n , V n ]] 

< E[log(l + E[U n f n (V±)\U n , Vn])] 

= E[log(l + U n E[f n {V?)\V n ])], (32) 

where (d) follows from the concavity of the logarithm. Let us define p n (V n ) = E[/ n (l/ 1 n ))|^4]. By using 
(1291 and we obtain 

^/(W;lf|t^) 



<i^E[log(l + C/ nPn (F ft ))] 



n=l 



f E 77 E E t 1 °g( 1 + U^T+tPd-DT+tiVi))] 



T ^ M 

t=l i=l 



T M 

< - E [!og(l + T7 E (33) 



t=l i=l 



Using (1331) and noting the fact that that {U^-i)T+t}fLi an d are jointly stationary and ergodic for 

1 < t < T, we can write 

T 

-J(W; Y X N \U» ) < - £ E *[log(l + Up t (V))], (34) 



^ V L J — rp 

t=l 



where p t (.) = P(i-i)T+t(-)- Combining (flOl) and (1341) , we conclude that 



1 T 

H<-^E t [log(l + t/p^))], (35) 
t=i 

subject to ±EL E [Pt(^)l <P- 

■ 

Remark: In Section HH we prove that the capacity of Markov channels is generally achieved by using 
multiple code multiplexing technique. However, for AWGN channel with time-correlated fading, the proof 
relies on using one Gaussian codebook, where the symbols are adaptively scaled by the appropriate power 
allocation function based on the side information at the transmitter. 
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IV. Performance Evaluation 

We study the impact of the channel correlation and feedback period on the capacity of the time-correlated 
Rayleigh fading channel. Let us assume that time-correlated Rayleigh fading channel is a Markov random 
process with the following PDF [18]: 

p / x _ f er u u > 

\ / in Atnf»ru;i co ^ ' 



0. otherwise 

Pi(u\v) = 5(v) 



P t (u\v) = Mu^a*- 1 ), (37) 



where 



I^exp(-^)j (^) u>0, 



$(u,v,a)=l l-<^ 2 ^ V l ~° 2 J ° V l ~° 2 ' ~ ' (38) 

[ otherwise. 

In (1381) . < a < 1 describes the channel correlation coefficient and X (.) denotes the modified Bessel 
function of order zero. Noting that the capacity in (fl"8l is a strictly concave region of p t , 1 < t < T, 
we numerically solve the convex optimization problem. In Figure [T] the capacity is depicted versus the 
feedback period for various channel correlation coefficients and compared to the capacity when no CSI 
is available at the transmitter. 




Time interval 



Fig. 1. Capacity of time-correlated Rayleigh fading channel versus T for SNR = 1 and channel correlation coefficients a = 
0.97, 0.95, 0.9, 0.8. The dash-dot line is the capacity with no side information at the transmitter. 



V. Conclusion 

We have obtained the capacity of finite state Markov channel with periodic feedback at the transmitter. 
Also, the channel capacity and optimal adaptive coding is derived for the time-correlated fading channel 
with periodic feedback. It is shown that the optimal adaptation can be achieved by a single Gaussian 
codebook, while scaling by the appropriate power. 
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